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Inf  oraation  DiTisIon 


The  research  problems  studied  during  the  first  year  of  this  grant  were 

concerned  with  sparse  matrix  technology  in  structural  analysis,  least 

squares  geodetic  adjustments  and  related  problems  in  science  and  engineering  . 

The  objectives  of  this  research  were  to  develop  fast  numerical 

algorithms  for  the  efficient  solution  to  large-scale  systems  of  linear 

equations  Ax  *  b  and  least  squares  problems  min  ||b  -  Ax|U  »  where  A  is 

x 

large  and  sparse  and  often  has  a  special  structured  form. 

One  particular  project  to  be  undertaken  involved  the  development  and 
testing  of  serial  and  parallel  algorithms  for  null  space  computations 
associated  with  the  efficient  Implementation  of  the  force  method  for  the 
stress  analysis  of  large-scale  structures.  A  second  major  project  involved 
the  development  of  direct-iterative  methods  for  large  sparse  least  squares 
and  constrained  minimization  problems  associated  with  structural  analysis 
and  with  the  adjustment  of  large  amounts  of  geodetic  data.  The  major  re¬ 
sults  obtained  thus  far  on  these  projects  are  outlined  in  the  next  section. 


II.  SUMMARY  OF  MAJOR  RESULTS 


The  most  Important  research  accomplishments  by  the  principal  investi¬ 
gator  and  co-workers  are  described  below.  These  results  have  been  obtained 
on  five  specific  problems  in  computational  mathematics  and  applications. 

1.  Sparse  Orthogonal  Schemes  for  Structural  Optimization  using  the 
Force  Method. 


Historically  there  are  two  principal  methods  of  matrix  struc¬ 
tural  analysis,  the  displacement  (or  stiffness)  method  and 
the  force  (or  flexibilty)  method.  In  recent  times  the  force 
method  has  been  used  relatively  little  because  the  displace¬ 
ment  method  has  been  deemed  easier  to  Implement  on  digital 
computers,  especially  for  large  sparse  systems.  The  force 
method  has  theoretical  advantages,  however,  for  multiple  re¬ 
design  problems  or  nonlinear  elastic  analysis  because  it 
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allows  the  solution  of  modified  problems  without  restarting 
the  computation  from  the  beginning.  In  this  work  an  imple¬ 
mentation  of  the  force  method  is  given  which  is  numerically 
stable  and  preserves  sparsity.  Although  it  is  motivated 
by  earlier  elimination  schemes,  in  this  approach  each  of 
the  two  main  phases  of  the  force  method  is  carried  out  using 
orthogonal  factorization  techniques  recently  developed  for 
linear  least  squares  problems  by  George,  Golub,  Heath  and 
Plemmons. 


2.  An  Algorithm  to  Compute  a  Sparse  Basis  of  the  Null  Space. 

Let  A  be  a  real  m  x  n  matrix  with  full  row  rank  m.  In  many 
algorithms  in  engineering  and  science,  such  as  the  force 
method  in  structural  analysis,  the  dual  variable  method 
for  the  Navier-Stokes  equations  or  more  generally  null  space 
methods  in  quadratic  programming,  it  is  necessary  to  compute 
a  basis  matrix  B  for  the  null  space  of  A.  Here  B  is  n  x  r, 
r  »  n  -  m,  of  rank  r,  with  AB  *  0.  In  many  Instances  A  is 
large  and  sparse  and  often  banded.  The  purpose  of  this 
work  was  to  describe  and  test  a  variation  of  a  method  origin 
ally  suggested  in  Germany  and  called  the  turnback  algorithm 
for  computing  a  banded  basis  matrix  B.  Two  Implementations 
of  the  algorithm  were  developed,  one  using  Gaussian  elim¬ 
ination  and  the  other  using  orthogonal  factorization  by 
Givens  rotations.  The  FORTRAN  software  was  executed  on  an 
IBM  3081  mainframe  computer  with  and  FPS-164  attached  array 
processor  at  the  Triangle  Universities  Computing  Center 
near  Raleigh,  N.  C.  Test  results  on  a  variety  of  structural 
analysis  problems  Including  two-  and  three-dimensional 
frames,  plane  stress,  plate  bending  and  mixed  finite  element 
problems  have  been  made.  These  results  indicate  that  both 
implementations  of  the  algorithm  yielded  a  well -conditioned, 
banded,  basis  matrix  B  when  A  Is  well  conditioned,  but  that 
the  orthogonal  scheme  yielded  a  better  conditioned  B  for 
large,  ill-conditioned  problems. 


3.  A  Direct- Iterative  Method  for  Large-scale  Least  Squares  Problems. 

IN  1975  Chen  and  Gentleman  suggested  a  3-block  SOR  method 
for  solving  least  squares  problems,  based  on  a  partitioning 
scheme  for  the  observation  matrix  A  into: 


where  A^  Is  square  and  nonsingular.  In  many  cases  Aj  Is 

obvious  fom  the  nature  of  the  problem.  This  combined  direct 
iterative  method  was  discussed  further  and  applied  to  angle 
adjustment  problems  in  geodesy,  where  A^  is  easily  formed 


and  is  large  and  sparse,  by  Plemmons  in  1979  for  the  Army 
Research  Office.  Recently,  Niethammer,  de  Pi 1 1 1 s  and  Varga 
have  rekindled  interest  In  this  method  by  correcting  and 
extending  the  SOR  convergence  interval.  The  purpose  of 
our  work  here  was  to  provide  an  alternate  formulation  of 
the  problem  leading  to  a  2-block  SOR  method.  For  this 
formulation  it  was  shown  that  the  resulting  direct-iterative 
method  alway  converges  for  sufficiently  small  SOR  parameter, 
in  contrast  to  the  3-block  formulation.  Formulas  for  the 
optium  SOR  parameter  and  the  resulting  asymptotic  converg¬ 
ence  factor  were  derived.  Furthermore,  it  was  shown  that 
this  2-cyclic  block  SOR  method  always  gives  better  conver¬ 
gence  results  than  the  3-cyclic  case  for  the  same  amount  of 
work  per  iteration. 


4.  Applications  of  a  Block  SOR  Method  to  a  Class  of  Constrained 
Minimization  Problems. 

In  this  work  it  is  shown  how  a  block  successive  overrelax¬ 
ation  direct-iterative  method  can  be  applied  to  the  solution 
of  a  class  of  linear  equality-constrained  quadratic  program- 
problems.  The  scheme  is  similar  in  nature  to  those  studied 
recently  be  de  Pi  His,  Niethammer  and  Varga  and  by  Markham, 
Neumann  and  Plemmons  for  solving  large  sparse  least  squares 
problems.  It  is  based  upon  a  partitioning  strategy  of  the 
fundamental  matrix  into  a  block  consistently  ordered  2-cyclic 
form  where  the  nonzero  eigenvalues  of  the  Jacobi  matrix  are 
all  pure  imaginary.  Applications  of  the  method  to  struct¬ 
ural  optimization  problems  have  been  developed  In  this  re¬ 
search. 


5.  Computing  a  Banded  Basis  of  the  Null  Space  on  the  Denelcor  HEP 
Multiprocessor. 

The  banded  null  space  basis  problem  Is  the  following: 
given  an  m  x  n  banded  matrix  A  of  rank  m  <  n,  find  a 
a  banded  n  x  (n  -  m)  matrix  B  whose  columns  span  the 
null  space  of  A,  i.e.,  find  a  banded  B  of  rank  n  -  m 
with  AB  =  0.  This  problem  arises  in  the  design  of  practical 
algorithms  for  large-scale  numerical  optimization  problems 
In  engineering  and  science,  such  as  the  force  method 
In  structural  analysis,  the  dual  variable  method  for 
solving  the  Navler-Stokes  equations  in  fluid  mechanics 
or  more  general  problems  In  linearly  and  nonllnearly 
constrained  minimization. 

The  purpose  of  this  work  Is  the  design  and 
testing  of  a  parallel  version  of  a  highly  effective  but 
intensive  turnback  algorithm  for  computing  a  banded 
basis  matrix  8.  The  turnback  algorithm  Is  shown  to 
exhibit  a  moderate  amount  of  modular  parallelism.  FORTRAN 
codes  for  both  the  serial  and  parallel  versions  of  the 
algorithm  have  been  Implemented  on  the  Denelcor  HEP 


computer  at  the  Argonne  National  Laboratory,  and  MIMD  multi¬ 
processing  machine.  Performance  results  using  some  practical 
data  from  structural  analysis  problems  indicate  that  our 
parallel  implementation  executes  significantly  faster  than 
the  serial  implementation. 


III.  RESEARCH  IN  PROGRESS 

Two  research  projects  in  support  of  this  grant  are  currently  underway 
and  are  briefly  described  below.  A  more  complete  description  of  this  current 
work  will  be  given  in  next  year's  annual  scientific  report. 

1 .  Accurate  Finite  Element  Calculations  for  Large-Scale  Structures. 

Here  a  new  approach  to  calculating  the  vector  f  of  internal 
forces  and  the  associated  stresses,  given  a  finite  element 
model  of  a  structure  and  an  external  load  vector  p,  is 
being  undertaken.  This  approach  is  based  upon  solving 
the  fundamental  problem  of  linear  elastic  analysis 

min  f^Ff  subject  to  Ef  *  p 

where  F  denotes  the  element  flexibility  matrix  and  E  the 
equilibrium  matrix,  as  a  linearly  constrained  least  squares 
problem.  This  least  squares  problem  Is  then  solved  by 
a  weighted  least  squares  scheme  using  an  iterative  Improvement 
scheme  developed  recently  by  C.  Van  Loan.  Numerical  com¬ 
parisons  are  currently  being  made  with  the  natural  factor 
formulation  of  this  problem  In  terms  of  accuracy  and  numerical 
efficiency.  This  work  is  joint  with  C.  Van  Loan  in  the 
Computer  Science  Department  at  Cornell  University. 

2.  A  Parallel  Version  of  the  Golub/Plemmons  Algorithm  for  the  Orthogonal 
Factorization  of  Least  Squares  Observation  Matrices  in  Block  Angular 
Form. 

The  Golub/Plemmons  block  orthogonal  factorization  algorithm 
was  designed  for  the  stable  least  squares  adjustment  of 
large  amounts  of  geodetic  data  which  Is  assembled  so  that 
the  observation  matrix  has  a  bordered  block  diagonal  form.  The 
algorithm  also  applied  to  more  general  substructuring  or 
domain  decomposition  methods.  In  this  work  It  is  being 
investigated  how  a  parallel  version  of  the  algorithm  can 
be  applied  not  only  to  the  factorization  scheme  but  also 
to  the  back  substitution  phase  of  the  least  squares  method. 
The  algorithm  Is  being  programmed  for  testing  on  the  Denelcor 
HEP  multiprocessing  computer  at  the  Argonne  National  Lab. 
This  is  joint  work  with  A.  Sameh  in  the  Computer  Science 
Department  at  the  University  of  Illinois. 
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